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In many situations, the decision maker observes items in sequence 
and needs to determine whether or not to retain a particular item im- 
mediately after it is observed. Any decision rule creates a set of items 
that are selected. We consider situations where the available informa- 
tion is the rank of a present observation relative to its predecessors. 
Certain "natural" selection rules are investigated. Theoretical results 
are presented pertaining to the evolution of the number of items se- 
lected, measures of their quality and the time it would take to amass 
a group of a given size. 

1. Introduction. Items (people) are observed sequentially, each receiving 
a score. The decision whether or not to accept an item must be made on 
the spot, without possibility of getting back to an item that has been let go. 
What would constitute a reasonable selection policy? 

The issues involved in formulating a policy are the quality and quantity 
of those selected and the size and rate at which the accepted set grows. 
Also of import is whether or not the horizon of the pool of items is finite. 
The scenario we envision is one where observations arrive in random order 
and their scores are independent and identically distributed, but nothing 
is otherwise known about their distribution so that, as items are observed, 
information about the pool of candidates is being gathered. Heuristically, 
quality of the items selected and speed of selecting items are in conflict with 
each other. A policy of selecting all items fulfills the need for speed, but the 
quality will be average. Toward the other extreme, declining to accept any 
item unless it is better than all of those observed previously will produce 
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a high-quality set of items, but its rate of growth will be very slow. In 
this paper we study certain policies that compromise between these two 
objectives. 

The procedure that accepts the first item and subsequently accepts only 
items which are better than all those observed previously is a well-studied 
policy (cf. [1] and [6]). 

Preater [5] studied a method that prescribes the acceptance of the first 
observation and subsequently accepts those observations that would improve 
the average score of those retained. Preater assumed that the scores are ex- 
ponentially distributed, and derived the asymptotic growth and distribution 
of the average score after n observations have been retained, as n — ► oo. Se- 
lection rules with known distribution of the items and inspection cost are 
considered in [4]. 

Other problems that have a similar flavor are variations of the secretary 
problem (cf. [2]), where one samples sequentially from a finite pool until one 
or several items are retained, after which sampling ceases, with the objective 
being the maximization of the probability of retaining the best in the pool. 

In this paper, we study sequential rules that are based on the ranks of 
the observations. At every stage, we (re-) rank the observed values from the 
best to the worst, so that the best has rank 1. We consider procedures that 
retain the nth observation if its rank is low enough relative to the ranks of 
the previously retained observations. 

For the sake of illustration, consider the median rule, which prescribes 
the retention of the nth observation if and only if its rank is lower than the 
median rank of the observations retained previously (i.e., the median of the 
retained group will be improved) . Regarding the speed at which observations 
are retained, let L n be the number of items that are retained after n items are 
observed. We show that the expected number E{L n ) of observations retained 
after n have been observed is of order n 1 / 2 and that L n /n 1 / 2 converges 
almost surely to a nondegenerate random variable. Regarding the quality 
of the retained observations, we show that at least half of the observations 
retained are the very best of all observed heretofore, and that the average 
rank of the retained observations and its expectation are of order n 1//2 log n 
(implying that almost all of the retained observations are very good). 

The paper is organized as follows. In Section 2 we introduce a general class 
of rules that are characterized by a criterion that ensures that the probability 
that item n + 1 is retained is a simple function of its rank and the number 
of items L n that have already been retained. In Section 3 we specialize and 
consider rules that retain an item if it is among the best lOOp percent of the 
items already retained. In Section 4 we show that E(L n )/n p converges and 
that L n /n p converges almost surely to a nondegenerate random variable. In 
Section 5 we find the order of the expected value of the average rank of the 
observations retained by the p-percentile rule and that suitably normalized, 
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the average converges a.s. to a nondegenerate random variable. We end with 
remarks and conclusions in Section 6. 

2. A class of selection rules. As stated in the previous section, our focus 
in this article is on selection rules based on ranks. In this section we introduce 
a class of selection rules that retain an observation if its rank is "low enough," 
where the threshold of "low enough" is determined solely by the size of the 
set of observations already retained. The rationale for this has to do with the 
trade-off between the quality of retained observations and the speed of their 
accumulation. Heuristically, the more observations retained, the slower one 
would go about retaining further observations, so the size of the retained set 
should be a factor in the selection rule. On the other hand, one's evaluation 
of the quality of an observation depends on all past observations, not only on 
those retained so far, and one's expectations regarding future observations 
is the same irrespective of the quality of those already retained. Therefore, 
there is good reason to require a selection rule to depend only on the size 
of the retained set of observations and the rank (among all observations) of 
the present observation. (As for the desire to "improve" the set of retained 
observations, heuristically, the quality of the retained set is correlated with 
its size, so at least qualitatively "improvement" is implicit in retained set 
size. We examine this more formally in Theorem 2.1 and Remark 2.1.) 

Formally, let X\ , X2 , ... be a sequence of observations so that any ordering 
of the first n observations is equally likely. A sufficient condition is that the 
random variables be exchangeable. A special case that satisfies this assump- 
tion is when we have independent and identically distributed (i.i.d.) random 
variables from a continuous distribution. Let S n be the set of indices of the 
retained X's after n items have been observed and let L n be the size of S n . 
Let i?" be the rank of the ith observation from among X\, . . . ,X n , that is, 
R i = E"=i < x i) where I {A} is the indicator function of A. Thus, R% 
is the rank of X n within the set {Aj}" =1 , where without loss of generality 
we assume that "better" is equivalent to "smaller" so that rank 1 is given 
to the smallest observation, rank 2 to the second smallest, etc. A selection 
rule of the type we study is defined by an integer- valued function r(-) on 
the integers such that the observation X n will be retained if and only if 
i?" < r(L n _i). In this article, we assume that the first observation is always 
kept. 

Another feature of a reasonable selection procedure is to require that the 
function r(-) be locally subdiagonal, that is, r(a + 1) < r(a) + 1. Again, the 
rationale for this has to do with the trade-off between the quality of retained 
observations and the speed of their accumulation. [To see this, suppose a 
observations have been retained after n have been observed. The rank of the 
next retained observation will not exceed r(a). The rank of the succeeding 
retained observation will not exceed r(a + 1). If r(a + 1) > r(a) + 1, it would 
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mean that after having retained a + 1 observations, one would be willing 
to settle for an observation of lower quality than the acceptance level after 
having retained o observations. Although that may be reasonable in a case 
that a quota has to be filled and the pool of applicants is finite, that is not 
the case we regard here.] 

We summarize the above in the following definition. 

Definition 2.1. A locally subdiagonal rank selection scheme {LsD) 
is a rule determined by an integer- valued function r(-) with the following 
properties: 

(i) r is nondecreasing. 

(ii) r(0) = 1 and L = 0. 

(iii) r is locally subdiagonal, that is, r(a + 1) < r(a) + 1. 

(iv) For n > 1, X n is retained if and only if R™ < r(L n _i). (This implies 
that the first observation is retained.) 

This class contains many rules that make heuristic sense. For instance, 
the median rule is a LsD rule with r(l) = r(2) = 1, r(3) = r(4) = 2, and 
generally r(2j — 1) = r(2j) = j. A class of LsD rules is "/c-record rules." For 
a fixed value k, let r(j) = min{j + 1, k}. For k = 1, this is the classical record 
rule, where an element is retained if and only if it is better than all previous 
observations, "/c-record rules" have been studied extensively (cf. [3] or [6] 
and many subsequent papers). 

The following theorem is trivial for "fc-record rules" but is true for any 
LsD rule. It attests to the high quality of the set of observations retainable 
by a LsD rule. 

Let N > 1 be any integer either predetermined or random. For example, 
N can be a stopping rule. A special case of interest is inverse sampling, 
where the objective is to collect a group of some fixed size m, so that 

N = inf{n : L n = m}. 

Theorem 2.1. Consider a LsD rule defined by r(-). The r(Ljy) best 
observations among X\, X2, ■ ■ ■ ,Xn belong to Sn- 

Proof. Let X m be the tth best observation among Xi, X2, • • • , Xn with 
t < r{L^)- Let a be the number of observations among X\, X2, ■ ■ ■ ,X m _\ 
that are better than X m and let b be the number of observations among 
X m+ i, . . . , Xm that are better than X m . Clearly, a + b = t — 1. 

If m^L Sjy, then the next items retained after iteration m must be better 
than X m . This implies that L^ < L m ^\ + b. Hence 

r{L N ) < r(L TO _i + b) < r(L m _i) + b. 
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But m £ Sn implies that a + 1 > r(L m _i). Since by assumption t < r(Ljy), 

a + b + 1 = t < r(L N ) < r(L m _i) + b 
so that a + 1 < r(-L m _i), which contradicts the inequality two lines above. 



Remark 2.1. Because of Theorem 2.1, implicit in the definition of a 
LsD rule is that it "improves" the retained set. For example, when applying 
the median rule, the median of the retained set gets better, something that 
is not transparent when regarding the median rule via its LsD definition. 
Theorem 2.1 is a more formal presentation of the heuristic stated in the 
beginning of this section, that the quality of the retained set is correlated 
with its size, and a LsD rule embodies all three heuristics: (i) the larger 
the retained set, the slower one goes about retaining more observations; (ii) 
perception of quality is founded on all previous observations; (iii) one only 
retains items that "improve" the retained set. Theorem 2.1 means that there 
is no contradiction between the third heuristic and selection based on the 
size of the retained set only. 

A natural representation of the quality of the group of observations kept 
(when retention is by ranks) is the average rank of the observations retained. 
Denote by Q n the sum of the ranks of the retained set after n observations 
have been made, so that the average rank A n is Q n /L n . 

Lemma 2.1. Let J- n be the a-field generated by the ranks of the i.i.d. 
continuous random variables X\, . . . ,X n . Let Q n = ^2i£s The condi- 

tional expected behavior of the quantities L n+ i,Q n+ i and the average rank 
A n+ i given the past, in terms of the corresponding quantities for n, results 
in: 



Proof, (i) It follows that L n+1 \L n = L n + B(^^-) where B(x) is a 
Bernoulli random variable with probability x. Hence (i) follows by taking 
conditional expectations on both sides. 

(ii) The sequence {Q n } is nondecreasing. Its growth can be described as 
follows. If X n+ \ is retained, and it has rank k among the items retained, 
then X n+ \ adds k to the sum of the ranks and 1 for each observation that 
is inferior to it. Hence, Q n +i = Q n + k + [L n — (k — 1)] = Q n + L n + 1. When 
X n+ \ is not retained, then the rank of some of the lower-quality retained 
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observations can increase (by 1, if X n+ \ has lower rank). Note that the dis- 
tribution of the rank of X n +i (conditional on T n and its not being retained) 
is uniform over r(L n ) + 1, . . . , n + 1. Therefore, for n > 1, 



E{Q n +l\Fn) 



{Qn + L n +l) 



r(L r 
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n + 1 - r(L n 
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(hi) If X n+ i is retained, then L n +i = L n + 1 and if X n +i is not retained, 
L n +i = L n - Therefore, the same argument as in the proof of (ii) leads to 

E{A n+1 \T n ) 
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Since Lemma 2.1 is central to the derivations in the sequel, we will hence- 
forth assume without loss of generality that the random variables X\ , X2 , . . . 
are i.i.d. and continuous. 

3. Percentile rules. In the following sections, we consider rules that re- 
tain items if the item is among the best 100» percent among those items 
that have already been retained. 

Definition 3.1. A p-percentile rule, for p fixed (0 < p < 1) is a LsD 
rule with r{k) = \pk~\ for k > 1, where \x] is the smallest integer that is 
greater than or equal to x. Thus, the nth item is retained if and only if its 
rank satisfies -R™ < [pL n _i] . 

To see that the p-percentile rule is a LsD rule, note that \p{a + 1)] = 
\pa + p] < \pa + 1] = \pa] + 1. 

Remark 3.1. Note that the p-percentile rule is meaningful even when 
p = 1. In that case, the first observation is kept. The second is kept if it is 
better than the first observation. In general, an item is kept if it is better 
than the worst item that has already been retained. It is easy to see that 
when»= 1, E(L n \L n _i) = L n _\ + L n _\/ n. It is straightforward to show that 
E(L n ) = 2±1. Hence E(L n )/n 1/2. Also, since E{L n \L n ^) = *^L n - X , 
it follows that L n f (n + 1) is a bounded positive martingale, and therefore 
converges almost surely. Since the worst item that has already been retained 
is obviously X\, it follows that L n /n is asymptotically U(0, 1). 

4. Results for the number of retained items. In this section, we study 
the behavior of the number of items that are retained after n items are 
observed, L n , for p-percentile rules. It turns out that L n is of order n p . 
Hence we consider the quantity L n /n p . We first show that the expectation 
of this quantity converges to a finite limit. We then show that this quantity 
itself converges almost surely to a nondegenerate random variable. 

The first result we present is that E(L n )/n p — > c p as n — > 00. For example, 
this result says that the rule that retains items if they are superior to the 
median of all items already retained, will be keeping on the order of y/n 
items on the average. The constant c p depends on 

d n = E(\pL n ] -pL n ). 

The relationship between c p and d\, d>2, ■ ■ ■ is complicated because it depends 
on all of the dj . It seems impossible to determine c p analytically, except for 
p = 1, as done in Remark 3.1. 
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The result, however, only requires that we show that d n is bounded away 
from zero. This result is intuitive. For the median rule (p = 1/2), d n is simply 
P(L n is odd)/2. Logically, we would expect (it turns out to be justified by 
empirical analysis) that P(L n is odd) — > 1/2 as n — > oo. This is not easy to 
prove. Similarly, if p = 1/4, then \pL n ~\ — pL n is either 0, 3/4, 1/2 or 1/4 
depending on whether L n (mod4) is j for j = 0, 1, 2 or 3, respectively. Since 
logically each of the four cases should be equally likely (again this appears 
to be the case by computer analysis), we would anticipate that d n — > 3/8. 
[We conjecture that if p is an irrational number, then [L n p] — L n p converges 
to U(0, 1) which implies that d n — > 1/2.] 

The following lemma shows that d n is bounded away from zero. 

Lemma 4.1. Let < p < 1 be fixed, and e = e v = min{|, ^-^}. Then 
d n > e/3 for all n. 

PROOF. Let S £ = {j | \pj~\ - pj < e}. Note that if j G S £ , then 

• j — 1 ^ S e . This follows since e + p < 1, thus \p(j — 1)] = \pj~\ . But then 

\p{j - 1)1 - pU ~ 1) = fail ~pj + P>P> £■ 

• j + 1 ^ /Sg. This follows since p — e > 0, thus [p(j + 1)] = \pj~\ + 1. Hence 

\p{j + 1)1 - P(j + 1) = fail - PJ + 1 - P > £• 
We will show that for all n > 2 and all j = 1, 2, . . . , 

(4.1) P(L n = j + 1) + P(L n = j - 1) - P(L n = j) > 0. 

This will yield the lemma since clearly (4.1) implies J2jes s P(^n = j) < 2x 
J2j<^s £ p ( L n = j), which in turn implies that T,j^s e P ( L n = j) > 1/3 so 
> e/3. Note that (4.1) is trivial for j > n. 

We prove (4.1) by induction. For n = 2 and all < j> < 1 we have P(L2 = 
1) = P(L 2 = 2) = 1/2. Thus (4.1) holds for j = 1, 2 and n = 2. 

Now assume (4.1) holds for 2, 3, . . . , n — 1. We shall show it holds for n. 
Consider first the values of j for which 

(4.2) 2\pj-]/n<l. 
Clearly 

(4.3) P(L n =j - 1) > (1 - fa(j - 1)1 /n)P(L„_i = j - 1), 

P(L n = j) = - l)l/n)P(L„_i = j - 1) 

(4.4) 

+ (l-fail/n)P( J L n _i=i), 
P(L n = j + 1) = ( fajl /n)P(L n „x = j) 

(4.5) 

+ (i-rK7 + i)l/™)P(£„-i=i + i). 
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Thus 

P(L n = j + 1) + P(L n =j-l)- P(L n = j) 
> P(L n _i = j + 1) + P(L n _! = j - 1) 
(4.6) - P(L n _x = j) + 2( fpj] /n)P(L„_i = j) 

-2(rp(i-l)l/n)P(L„_ 1= i-l) 
-(rp(j + l)l/n)P(L n _ 1= i + l). 

However, — 1)] < [pj] and [~p(j + 1)] < 2\pj~\ as \pj~\ > 1. Hence, the 
right-hand side of (4.6) is greater than or equal to 

(1 - 2\pj]/n)[P(L nr . 1 =j + 1) + P(P„_i = j - 1) - P(L n _x = j)] 

(4.7) 

> 

where the last inequality in (4.7) follows from (4.2) and the induction hy- 
pothesis. 

Now consider values of j (if such exist) for which 
(4.8) 2\pj]/n>l. 
Then clearly j >1. Replace (4.3) by 

P(L n = j-l) = (l- \p(j - l)] /n)P(L n _i = j - 1) 

(4-9) 

+ (\p(j-2)]/n)P(L n ^=j-2) 

and replace (4.5) by 

(4.10) P(L n =j + 1) > (\pj]/n)P(L n ^=j). 

Then by (4.9), (4.4) and (4.10), it follows that 

P(L n =j + l) + P(L n =j-l)- P(L n = j) 
>(2\pj]/n-l)P(L n _ 1 =j) 

(4.H) 

+ (\p(j-2)yn)P(L n -i=j-2) 

- {2\p{ 3 - 1)1 /n- l)P(L n _! =j - 1). 

If 2\p(j — 1)1 /n < 1, then by (4.8) clearly the value in the right-hand side 
of (4.11) is nonnegative. If 

(4.12) 2\p(j-l)]/n-l>0, 
we shall show that (4.12) implies 

(4.13) \p(j - 2)l/7i > 2\p(j - l)] jn - 1 
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so that the right-hand side of (4.11) is greater than or equal to 

(2\p{j - 1)1 /n - l)[P(L n _! =j) + P{L n - 1 =j - 2) - P(L n _ x = j - 1)] > 

where the last inequality follows from (4.12) and the induction hypothesis. 
To see (4.13) note that \p{j - 2)] > \p(j - 1)] - 1. Thus (4.13) will follow if 
we show that (\p(j — 1)] — l)/n> 2\p(j — l)]/n — 1 which is equivalent to 

(4.14) n-l>[p(i-l)"|. 

Since j < n are the only values of interest, we have j — 1 < n — 1, for which 

(4.14) clearly holds. □ 

We now turn to the main result of showing that the average number of 
items that are retained is of order n p . From Lemma 2.1(i), 

E{L n \L n -\) = L n -i + \pL n -{\/n. 

Hence, 

E{L n \L n -i) =L n -! +pL n _ 1 /n + (\pL n _{\ -pL n -.\)/n. 
Let M n = E(L n ). Then 

(4.15) M n = M„„i(l+p/n)+d n _i/n. 
We are now prepared to state and prove: 

Theorem 4.1. Let < p < 1 . E(L n ) j-riP — >• c p as n — > oo with < c p < 
oo. 

Proof. By Remark 3.1, c\ = 1/2. Thus consider a fixed p, <p< 1, 
and let T n = M n /n p . From (4.15) we have that 

(4.16) T n = ((n - l)/n) p (l + p/n)T n ^ + d n „!/n 1+ P. 

The key to the proof is showing that A n = T n — T n _i eventually becomes 
positive and remains positive. Since T n = Y7j=i with Tq = and T n will 
be shown to be bounded, it follows that T n converges. 
By the definition of Aj and (4.16), 

(4.17) Aj = bjTj-i + dj-i/j 1+p 

where bj = ((j - l)/j) p (l +p/j) - 1. 

The basis of the proof is in the result that bj < and increases to as 
j — ► oo. This is a straightforward calculus argument. 

Let x = l/j and f(x) = (l-x) p (l+px) - 1. Thus, f(0) = 0. Also, f(x) < 
by routine calculus. 
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From bj < and (4.17) it follows that 

n 1 rn 

(4.18) T„<l + £ TO <l+/_ (l/x) 1+p dx<2/p 

j=2 3 Jx-1 

so T n is bounded. To show that A n is eventually nonnegative note that 
[by (4.18)] A n > «-> T„_i < — n i" ■ It is again a straightforward calculus 

argument to show that — i+ Pb > oo. Since by (4.18) T n < 2/p, for all n, 

that coupled with Lemma 4.1 will complete the proof. Consider 



j i+p . 



j 

Again, let x = 1/j and so —j 1+p bj becomes 

g(x) = [1 - (1 - x) p (l + px)]/x 1+p . 

We need to show that g(x) — > as x — > 0. This follows easily by l'Hospital's 
rule. □ 

We just showed that E{L n /n p ) converges as n — ► oo. Next we show that 
L n /n p has an almost sure limit. We prove this by showing that L n /{n + l) p 
is a (positive) submartingale and that E(L^/n 2p ) is bounded. 

Theorem 4.2. \mi n ^ O0 E(L' 2 n /n 2p ) exists and is finite. 

Proof. Let U n = E(L 2 l /n 2p ). We first show that there exist constants 
< c\ (p) < C2 (p) < oo such that for all n > 1 

(4.19) Cl (p)<U n <c 2 (p). 

The left-hand side of inequality (4.19) follows trivially from Theorem 4.1, 
since U n > (E(L n )/n p ) 2 — > c 2 . For the right-hand side inequality of (4.19), 
note that 

- 2 I -jr {-> \P L n-l]\ . IT . r , 2 \P L n-l] 



E(L 2 n \T n ^) = Li_ x 1 - + (L„_x + 1 



n J n 



(4.20) <L 2 „_ 1 + (2L n _ 1 + l)? L "" 14 



n 

-2 



Thus 



n ) n n 



c/ „< CJ „_ 1 i^ 2 V 1+ ^ + E(L "-,'j'! +2) + 1 



n J \ n J n i+2p n i+2p 
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Therefore, 

(4.21) 1UM U) 

, £(L n _i/(n-l)P)(p + 2) 1 



Note that since f(x) = (1 - x) 2p (l + 2px) - 1 satisfies /(0) = and /'(x) < 
for x > 0, it follows that (^) 2p (l + f ) - 1< 0. Since £(L n _i/(n - If) is 
bounded, it follows from (4.21) that (with Uq = 0) 



const 



U n = J2( u j ~ Uj-i) < E -tw < °°' 



J'=l J=l 



which accounts for (4.19). 

Now denote Aj = Uj - Uj-i, so that U n = YJj=i ^j- % virtue of (4.19) 
to complete the proof it suffices to show that A,- > for all j sufficiently 
large. By (4.20), 

E{Ll\T n ^) = LU + 2(L n _ x + l)M=zll 

n 



>Ll_ l + {2L n ^ + l) P - 



n 



Thus 



Now for some < < 1, by Taylor's theorem, 

2P = (\ _ = i _ BP + P(2p~l) ^ _ ^ - 2(1 " P) 

Hence there exists a constant 03(7?) > such that for all j ' > 1 



j j \ 3 j j j 2 

Also, there exists a constant c^{p) > such that E(Lj—\/j p ) > c<±(p) for all 

j > 1. But then, for all j sufficiently large, A,- > ~ C3(p)c2(p) -t pC4(p)il " P > 0. 
□ 



Corollary 4.1. linin^oo Var(L n /n p ) exists and is finite. 

Let j n = [pLnl be the cutoff rank such that the (n + l)st item is retained 
if and only if its rank from among the first n + 1 observations is less than 
or equal to j n . 
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Theorem 4.3. L n /(n + l) p is a submartingale that converges almost 
surely as n — > oo to a nondegenerate finite random variable A such that 
lim^oo E(L n /(n + l) p ) = EA = c p , for allO<p<l. 



PROOF. Since j n = \pL n ] , 

jn-l 



so 



E{L n \J- n - 



n 



(L n _l + 1) + 1 



Jn-l 



n 



L 



n-l 



L 



E 



L, 



(n + l)P 



•F n- 



> 



> 



n-l 



Ln-l 

nP 

L n -\ 
nP 

L n -\ 
nP 



1 + 



n 



n 



+ 



Jn-l -pL n -i 



n 



n+1 

n 



1 + 



n 



+ 



j n -! - pL 



n-l 



n{n + l)P 



n+1 



1 + 



P 



v 



Therefore, L n /(n-\- l) p is a positive submartingale. Because E(L n /(n + l) p ) 
and E(l? n l (n + l) 2p ) are both bounded (by virtue of Theorems 4.1 and 4.2), 
Theorem 4.3 follows from the submartingale convergence theorem. □ 



5. The quality of the retained group of observations acquired by a 
p-percentile rule. In the previous sections, the focus was on the size of 
the group retained by the p-percentile rule. Here, attention is focused on its 
quality. 

In general p-percentile rules yield a qualitative crop. A prime indication 
of this is Theorem 2.1 — after n observations of which L n have been retained, 
the best [p-L n ] of all n observations seen heretofore are among the retained 
set. As will be shown below in this section, the other retained observations 
are generally also of high quality. 

To this end, the following theorem considers the average rank of the re- 
tained items A n , which equals Q n /L n . 

Theorem 5.1. There exist constants < b p < oo such that for < p < 1, 
E{A n )/a n {p) — > b p , where 



( n 1 -* 
a n(p) = I n 1/2 logn, 
[nP, 



if p< 1/2, 
if p= 1/2, 
if p> 1/2, 
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Cl/2/8, 
p 2 



2(2p - 1) 



Cp 1 



if p = 1/2, 
if p> 1/2, 



where c p is the limit of E(L n /n p ). 



Proof. From Lemma 2.1(iii), with r(L n ) = j n = \pL r 



1 + L 



n Jn 



(5.1) E(A n+1 \T n )=A n 1 + 

L (re+l)(L n + l) 

Let Y n = A n /n l ~ p '. Equation (5.1) implies 

(5.2) E(Y n+l \T n ) = G n Y n + B 
where 



+ 



jn(jn-l)/2 
(n + l)L n 



n 



n + 1 



i-p 



1 + 



and 
(5.3) 



B,, 



1 + L n — j n 
(n + l)(L n + l) 

j n (j n - l)/2 



L n {n + lf-P' 

We consider B n first. Since pL n < j n < pL n + 1, 

(p 2 L n -p)/2 (p 2 L n +p)/2 
<B n < 



(n+1) 



2-p 



(n+1) 



2-p 



By Theorem 4.1, E(L n /n p ) — > c„, which implies 

n— >oo * 

(5.4) E(B n )n 2 ~ 2p — p 2 c p /2. 

We consider G n next. Let e n = pL n + p — \pL n ~\ , so that 
1 + L n — j n 



(5.5) 



L n + 1 

l-p 
n+1 

G n = l + 



Since (^x) 1 ^ = 1 - + 0(£) and since \e n \ < 1, 



+ 



1 



n- 



(L n + l)(n + l) 
Substituting (5.5) into (5.2) yields 

_(L n + l)(n+l) +U {n 2 
After taking expectations in (5.6), it follows that 



(5.6) E(Y n+1 \F n ) = Y n + 



(X 



Y n + B n . 



(5.7) E(Y n+1 ) = [E(Y m +i) ~ E(Y m )] = £ E(D rn ) + ]T E(B m ) 



m=0 



m=l 



m=l 
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where D m = [ (Lm+ e ff (m+1) + 0{^)]Y m and Y = 0. 

Our aim is to show that X)m=i^(^m) an d Em=i^(^m) (or variants 
thereof for p > ^) have finite limits as n — ► oo. For the first sum, since 
L m < m, it is sufficient to show that E(J2m=i (l +Y)(m+i) ) nas a fi^te limit. 
Now 

El -, ^ =E ( Am 



(L m + l)(m + l)J \m 1 -P(m + l)(L m + l) 

< {E[A m ^-I{L m >m £ })+ E(A m I{L m < m £ }) 

By virtue of Lemma A.l (in the Appendix) there exists a constant < 
c £)P < oo such that, for < e < 1/2, 

P(L m <m £ ) < — - — rf'f. . ; for all 1 < m < oo. 
v y - m l-ri+l/p|e - 

Note that A m < m. Therefore choosing < e < (1 — V)llp (with 7 p = 
[1 + i]), it follows that 



p 



<^^E(A m ) + c eiP /m 2 -P-^. 



(L m + l)(m + l) 

We now divide the proof into three cases. 
Case (i): p < 1/2. 

(a) E™=i E(B m ) < oo by virtue of (5.4). 

(b) E™=i ^(^ m )/m 2 -P+ e = Em=l E[A m /m l -P +£ l 2 ]/m l+£ / 2 < oo by vir- 
tue of Lemma A. 2 (in the Appendix). 

(c) Clearly, £~ =1 c E , p /m 2 -P-^ < oo. 

Case (ii): p = 1/2. We need to divide both sides of (5.7) by log n. 

( a ) J2m=i E ( B m)Aogn — ^p 2 c p /2 = c 1/2 /8 by virtue of (5.4) since 
E(B m ) = (p 2 c p + e m )/2m where £ m ^0asm^oo and 2~2m=l — * 1- 

(b) E™=i i?(^ m )/m 2 -P +£ = Em=l J E[^ m /m 1 -f +£ / 2 ]/m 1 + £ / 2 < oo by vir- 
tue of Lemma A. 2 (in the Appendix). Hence 

E7n=iE(Am)/m 2 -P+ £ 



logn 

(c) Clearly, 

E n m =iCe, P /m 2 -P-^ 



as n — > oo. 



logn 

(since the numerator is summable). Hence 

E(A n ) 



— ► 

n— »oo 



n 1 / 2 logn n^oo 



Cl/2/8. 
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Case (iii): p > 1/2. We need to divide both sides of (5.7) by n 2p ~ 1 . 

(a) YZ=i E(B m )/n 2p ~ l ^ ^£^ c p b y virtue of ( 5 - 4 ) since E ( B ™) 

(p 2 c p /2 + e m )/m 2 ~ 2p where e m -> as m -> oo and Em=i ^^/™ 1 ~ 2p 
l 



2p-l- 

(b) 

fj2p— 1 y^p— 1 n— >oo 

by virtue of Lemma A. 2 (in the Appendix). 

(c) For small enough e, L ^ m = 1 e, | p _ 1 > (since the numerator 

n n— >oo 

is summable). Hence 



nP n _^oo 2 (2p - 1) 



□ 



We now consider the almost sure convergence properties of the average 
rank of the items kept, suitably normalized. We shall need the following 
result, due to Robbins and Siegmund [7], quoted as follows: 



Proposition 5.1. Let (0,,{F,P) be a probability space and letT\ C^C 
• • • be a sequence of sub-a -algebras of J 7 . For each n = 1, 2, . . . , let z n , f3 n , £ n 
and Cn be nonnegative Tn-measurable random variables such that 

E(z n+ i\T n ) < z n (l + P n ) +£ n - Cn- 

Then lim n ^ 00 z n exists and is finite and Y^=i Cn < oo a.s. on {J2n°=i Pn < 
oo, E^i&i < oo}. 

The limiting behavior of A n , the average rank of the retained observations, 
depends on p. Theorems 5.2 ; 5.3 and 5.4 show the results for < p < |, \ < 
p < 1 and p = i , respectively. 

Theorem 5.2. IfO<p<^, then A n /n l ~ p converges almost surely as 
n — > oo to a nondegenerate random variable. 

Theorem 5.3. lf\ < p < 1, then V n q p where V n = Q n /L 2 n = A n /L v 
and q p = \p 2 j(2p — 1). Furthermore, E(V n ) — ► q p . 

Theorem 5.4. If p = \ then F n /logn 1/8 and E(V n /\ogn) — >■ 
1/8. 
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Proof of Theorem 5.2. We show that the almost sure convergence 
of A n /n l ~ p is the result of a direct application of Proposition 5.1 above. 

Regard (5.6). Note that B n of (5.3) can be written as B n = ^ 
where |0 n | < 1. Apply Proposition 5.1 to (5.6) with z n = Y n , £ n = B n , ( n = 
0, (3 n = {e n /[(L n + l)(n + 1)] + \0(n~ 2 )}, all nonnegative. Since L n /n v 
converges a.s., J2n°=i Pn < 00 an d X)m=i£m < oo a.s. The nondegeneracy of 
the limit follows from the fact that the first observations have rank of order 
n and their influence on A n /n l ~ p does not vanish as n — ► oo. □ 

In order to obtain results for p > 1/2 we need two lemmas. These lemmas 
describe the extent to which the sums of the ranks of the items kept Q n +i 
increase after the (n + l)st item is observed. If the (n + l)st item is kept, 
it is easy to see that Q n +i = Qn + L n + 1. The difficulty arises when the 
(n + l)st item is not kept. Even though the (n + l)st item is not kept, it 
might still be better than items that have been kept. 

Let X* be the values of the items that are kept after having observed n 
items for i = 1, . . . , L n where the items are indexed from smallest (best) to 
largest (worst). The sum of the ranks of the kept items after n + 1 items 
are observed, provided that the (n + l)st item is not kept, is denoted by 



Q n + A* n+1 where A* n+l = Et jn+ i H^n+i <X*). The behavior of A n+1 



captured as follows. 

Lemma 5.1. 
(5.8) E(A* n+1 \X n+1 >X* n ,F n ) 
and 



is 



Qn ~ (l/2)j n (j n + 1) ~ j n (L n ~ jr 

n + l-jn 
Q n {2L n -2j n + l) 



(5.9) E(AZ +1 \X n+1 >X* n ,F n )< 

' 1 ~r 1 Jn 

where j n = \pL n ~\ as above. 
Proof. 

E(I(X n+1 < X*)\X n+1 > X*^T n ) 



and 



n+l-jn 

for i > j n where Rf 1 is the rank of X* from among X\ ,.. . X v 



Ln 

(Ri ~ jn) = Qn — \jn{jn + 1) — jn{L n — jr 



hence (5.8) follows. 
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Since I(X n+1 <X*) = 1^ I{X n+1 < X*) = 1, for j>i> j n , 
E(A^ +1 \X n+ i > X* n ,T n ) 

= E(A n+1 \X n +i > X* n ,T n ) 



+ 2E 



L n —X L n 

]T J2 i(x n+ i<x*)\x n+1 >xi,F n 

i=jn+l k=i+l 



< E(A* n+1 \X n+1 > X* n ,F n )(2L n - 2j n + 1). 
So (5.9) follows from (5.8). □ 

Lemma 5.2. LetV n = Q n /L n . Then 



(5.10) E{V n+l \T n ) = V n (l-^-± + o(^- 

\ n+l \nL n 

Proof. 

E(V n+1 \F n ) 

Jn {°in 

+ L n + l 



n + l 



nLr 



(5.11) 



But 



n + l (L n + 1) 2 



+ 



n+l- j n 



Qn + 



(n + l)L n 

jn Qn + L n + 1 

n + l (L n + 1) 2 



Qn ~ (l/2)j n (j n + 1) ~ j n (L n ~ jn) 

n + l-j n 



J, 



-V 



+ 



Jn 



n + l n {L n + l) 2 n+lL„ + l 



(5.12) 



n + l 

_j 
n + l 



-VJ 1 



+ 



1 



V n + 



Vn 



n + l 



-2p + 



n + 1 L n + 1 
1 



L, 



+ 



1 



P + 0[ — 



n+l 

Also, the second term in (5.11) equals 



n + l- j n _ V n 

" v n + 



1 



(5.13) 



n + l 



n+l n+ 1 V2 



-p 2 +p(l-p) + — 



n + l- j r 
n+l 



V n + 



V, 



n+l n+l 



P-^P +0 



L, 



Li 
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Lemma 5.3. 

E{VZ + x\Fn) = VZ-^(2p-l + 0(±- 

(5.14) 



V, 



+ p 2 + o(tH )+o 



n + l\ \L n J J \nL n 

Proof. 

(Qn + L n + l) 2 



n + 1 (L„ + l) 4 

l n + l ) L n 



But 



(Q n + L n + 1) 



2 



(5.15) 



Hence, 



9l( 1 L-V+ ^ +0 (± 

L*\ L n + lJ + (L n + l)3 +U U 2 

v*(i 4 — + o(± 

+ 2Qn +o(± 

+ (L n + l) 3 U 



j n (Qn + L Tl + l) 2 

n+1 (L„ + l) 4 

-2 



n + 1 \ \L n J J \ nL r 

Similarly, by Lemma 5.1, 

(n + + A n+l) 2 \ X n+l > X j n >F n } 

n + 1- j n T , 2 , 2( 3n ^ 1 



+ 7 , -iwa Qn - rinOn + 1) ~ jn(^n ~ Jr 



n+1 n (n+l)i 4 V 2 
(5.17) 
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n + l~jn, r2 , 2^ n 2 2V n (I 2 



-V + 

(n + 1) n n + 1 n+l\2 



p z +p(l-p) + 



Li 



since by (5.9) the term involving E(A n+1 \X n+ i > X^ n ,F n ) is bounded by 
Q n (2L n -2j n + l) V n (J_\ 



(n+1) Li n+1 \L n 

Combining (5.16) and (5.17) yields (5.14). □ 



Proof of Theorem 5.3. The claim is trivial for p = 1, for then Q n = 
^L n (L n + 1). Suppose 2 < P < 1- Let V* = (V n — q p ) 2 . We shall show that 
V* -^0 a.s.: 

E{V:+i\Fn) = E(V 2 +l \T n ) ~ 2q p E{V n+1 \f n ) + gg. 
Lemmas 5.2 and 5.3 imply that 



E(v: +1 \r n )=v: 

(5.18) + 



2V 2 
n + 1 



2p-l + 1 ( — 



n + 1 

q P p 2 

n + 1 



p 2 + 2g p (2p - 1) + 2 



l + Os 



L, 



Notice that |F n - q p \ = W* n so that V n < W* n + g p . Hence < V n < V* + 1 



K 2 = V* + 2g p T/ n - q 2 . Recalling that q p = \p 2 j(2p - 1), we obtain [after 
algebra, applying (5.18)] 



E(v: +l \F n ) = [i 



2(2p-l)\ , 2(2p-lK / 1 



n + 1 



K* + 



(5.19) 



+ 



< 1 



Vn 



2 — + 



n + 1 
2 / ! 



Oi — 



n+1 V L n / n+1 ' \L n 
2(2p-l)+Q 2 (l/L 
n + 1 



Since L n /n p — to a finite random variable, there exists (a random) no such 
that I O2 ( — ) I < (2p — 1) for all n > no, so that for all n > no, 



(5.20) 



E{V* +1 \F n )<V: + 



O 
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But since L n /n p converges a.s., it follows that ^-)| is a.s. summable. 
Therefore, it follows by Proposition 5.1 that V* converges almost surely. 

We claim that V* —> 0. Suppose this is false and there exists a set A of 
positive measure such that limT^* > a a.s. on A, where a > 0. Since lim V^* 
exists, (5.19) can be rewritten as 

(5.2i) E(v: +1 \r n ) <(i- 2( y~ 1) N K* + °( 



n + 1 / \nL, 

For given < e < a, there exists (a random) n\ > no such that, on A, V* > 
a — e for all n > n\. Clearly P(A\J r ni ) > 0. From (5.21) it follows that 

n 

E(V:\F ni )= Yl ew-vu^ + v^ 

i=ni+l 

n -, 

<-2(2p-l){a-e)P{A\F ni ) £ + V £+c 

i=ni+l 

> — OO 

where c = E£„, + i|0(^)l<oo. 

This is clearly a contradiction since is nonnegative. Therefore, V* —> 
a.s., that is, V„ — > g p a.s. 

By virtue of Lemma A. 2 in the Appendix, there exists rj > such that 
< Therefore it follows from (5.20) that {E(V n )*} is bounded. 
V^j — ► q p a.s. now implies EV n — > q p . □ 



Sketch of proof of Theorem 5.4. Let V* = - |) 2 . 
Ignoring O(-) terms, it follows from (5.10) that 



E 

(5.22) 



Vn lQgn | 1/8 



log(n + 1) / lognlog(n + l) nlog(n+l) 

Vn 1 \ + 1/8 



lognv nlog(n + l)/ nlog(n + l) 



The last expression comes from log nj log(n + 1) = 1 — nlog (" n+1 ) + 0( n zio gn ] 
Similarly, from (5.14), 



E 



V 



2 

n+1 



log^n + l) 
V 2 



1 



log n V nlog(n+l) 
(5.23) 
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+ 



1 V n 



1 



4 logn nlog(n+l)\ nlog(n+l 



1 



1 



V 2 



2V 2 
* n 



1 



log n log nn log(n + 1) 4 logn nlog(n + 1 



V 2 

v n+l 



But 

(5.24) E(V: +1 \F n )=E 

log [n + 1 

Substituting (5.22) and (5.23) into (5.24) yields 

E(v: +1 \r n )^v: 

= V* 



■F n 


-Ie 




•F n 


log(n + 1) 




4 





+ 



64 



1 



nlog(n + 1) \logn 8 
2 

V*. 

nlog(n + 1) n 



The remainder of the proof follows in a fashion similar to the proof of The- 
orem 5.3. □ 

6. Discussion. An extensive simulation study has been performed with 
varying horizons up to 10,000 and also 10,000 replications. The results of 
this simulation will be published elsewhere. The estimated constants for the 
limit of E(L n /n p ) (see Theorem 4.1) and E{A n / a n {p)) (see Theorem 5.1) 
are presented in Table 1. 

The standard errors for estimation of the constants are less than 0.002. 

Remark 6.1. Some of the results of the preceding section carry over to 
the inverse problem of fixing the number of items kept and considering the 
number of observations required until this goal is achieved. Suppose that a 
p-percentile rule is applied. Let Zi (for i > 1) be the number of observations 
made from the instant that the size of the set of retained observations became 
i — 1 until its size became i. Also, let A^ n = Ya=i be the number of obser- 
vations made until n have been retained. The results stated in Theorems 4.3 
and 5.2, 5.3 and 5.4 carry over directly to iV n , and Qyv n : L^ n /N p = n/N? 
converges a.s. as n — ► oo to a finite, nondegenerate random variable. For 
p < h, p=\, p> \ the quantities 



Table 1 





a„(p) 




*/p<0.5, a„(i) 


= n 1 / 2 


logn, a„ 


( P ) = n p 


ifp>0 






p 


0.1 


0.2 


0.3 0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


E{L n /n p ) a 
E(A n /a n ) a 


4.178 
0.238 


2.674 
0.401 


2.111 1.653 
0.578 0.967 


1.181 
0.214 


1.198 
0.978 


1.045 
0.634 


0.841 
0.449 


0.693 
0.351 


0.500 
0.250 



"Estimate of the limit using n = 10,000 and 10,000 replications. 
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(a) ^/Nt P ,^/(N?/ 2 logN n ),^/NP, 

respectively, converge a.s. as n — > oo to nondegenerate random variables, 
and for p > i , 

W) Q N Jn^y/[2(2p-l)] a.s. 

For < p < 1 and n > 2 the expectation of iV n is oo [since E{Z<i) = oo]. For 
p<2,P=2)P > 2' respectively, the corresponding expectation of each of the 
three expressions in (a) converges to finite positive constants as n — > oo, and 
for p > \ the expectation of the term on the left-hand side of (/?) converges 
to the value in the right side, as n — > oo. 

Remark 6.2. Define Rf = l + E"=i I{ x j < x i)- ( Tni s does not consti- 
tute a change when the X's are i.i.d. continuous.) Then Theorem 2.1 and 
its proof hold verbatim, even if there are ties among the X's (which may be 
the case if the X's are discrete random variables), and even if the X's are 
not random. 

Remark 6.3. Other rules can be evaluated in a manner similar to the 
p-percentile rules. For example, for /c-record rules using Lemma 2.1, it can be 
shown that L n /logn converges almost surely to k as n — > oo and E(L n ) / log n 
also converges to k. It can be shown that Q n /{ n + 1) is a (nonnegative) sub- 
martingale that converges a.s. as n — > oo to a nondegenerate random vari- 
able, and E(Q n )/(n + 1) — > k. Thus A n log nln converges a.s. to a nonde- 

n — >oo 

generate random variable. 

Remark 6.4. The complexity of the sorting problem is of order nlogn. 
Sometimes, one is interested in retaining (in sorted form) only some of the 
best observations rather than the whole set. In this case, a p-percentile 
rule with < p < 1 obtains a sorted set of best observations, and the com- 
plexity is of order n. To see this, note that initially each observation has 
to be compared only to the p-percentile of the retained set — amounting to 
n operations — and each retained observation must be compared to (roughly) 
100p% of the retained observations, amounting (at most) to another 
2X^j=i l°g(j + l) = 0{L n \ogL n ) = O p (n p log n) operations (since the retained 
set can be stored in sorted condition). 

APPENDIX 

Lemma A.l. For any p-percentile rule with < p < 1 and any < e < 1, 
there exists a constant < c £iP < oo such that 

P(L n <k)< Cs^'^/n 1 ^ 

where ro = |~l/p] • 
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Proof. After m—1 observations have been retained by the p-percentile 
rule, let Z m denote the number of additional observations until the next 
retention. Note that Z\ = 1. Let N n = Ya=i be the number of observations 
made until n items have been retained. Thus, 

(A.l) P(L n <k) = P{N k >n) = P{Nl~ £ > n l ~ e ) < E(N^ £ ) /n l ~ £ . 

Without loss of generality, assume that the X{ have a U[0, 1] distribution. 
Let Xf denote the observation with rank i among X\, X2, • • • , X n . Note that 
conditional on X%, . . . , X^ m , the distribution of Z m+ \ is 

Geometric p with p = A"f m = X?" 1 -,. 

Also note that conditional on N m , the distribution of Xr^i 1S Beta ( \pm~\ , 
N m + 1- \pm\). 

Therefore, for < e < 1 

E(Z^ 1 \N m ) = E[E(Z 1 ^ 1 \N m ,Xf p 2 ] )\N m ] 

< E([E(Z m+1 \N m , X^)f~ e \N m ) 



= E 

(A.2) 



1 



1-e 



N m 



(\pm\ - l)\{N m -\pm\)\ 

T(\pm] + e- l)T(N m + 1 - \pm\) 

x r(iv m + e ) 

N m \ T{\pm\+e-l) 



{\pm]-l)\ T(N m + e) 



< N m 



\pm\ + e — 1 

The last inequality in (A.2) follows since f(A) = T(A + e) / (T(A) is increasing 
in A for any integer A. This is easily seen since f(A + l)/f(A) = (A + e)/A > 
1 for all £ > 0. 

For e = and \pm\ > 1 obtain E(Z m+ i\N m ) < N m /{\pm\ — 1), so that 
E(N m+1 \N m ) < J^L- N m . 

Hence 

E(N^ + \\N m ) < [E(N m+1 \N m )}^ < ( J^L^ ^N^. 



SELECT SETS 25 
Letting m p be the smallest m such that \pm\ > 1, it follows that 

(A-3) m^ + \\N mp )<Ni-; f[ (j^y) 1 "- 

We first show that E(N^ £ ) is finite. By virtue of (A. 2), since Z m+ i > 1, 

^(iv^iiv^) = s((iv m + z^+i) 1 - 5 !^) 

< J E(iv^|iv rn ) + i5;(^- £ 1 |iv m ) 

<N^ + \E{Z x ^P\N m )\^ 



< _/v 1_e + 



l-e/2 



(\pm\ +e/2 - 



This recursive relation can be applied repeatedly. Since E(N^ £ ) = 1, it 
follows that E(N^~ e ) is finite for all < e < 1. 

Finally, note that ^jfp?[ > 1 an d can appear (in the product n^mp \p(]-i ) 
at most ro = times. Hence, 

n( T JfL) 1 - e £ ( T ^)" <1 -^ M -. 

For m = k, the conclusion follows from (A.l) and (A. 3). □ 

Lemma A. 2. Lei < e < 1 and consider a p-percentile rule with <p< 
1. Lei p* = max(j>, 1 — p). T/ien 

lim El -4=-] =0. 



oo V nP* Jr£ 

Proof. The statement is trivially true for p = 1 as E(A n /n) — > 1/4 as 
72 — ► oo. Thus consider < p < 1. Let 1 < k £ < oo be an integer such that 

1+L r; jn <(i- P )(i +e )< p «(i+ e ) 

whenever L n > k £ , where j n = [pL n ] . 

From Lemma A.l we obtain 

P(L n <k e )<cl pM Jn l - £ . 
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Using Lemma 2.1 (iii), 

1 + L n - j n \ j n {jn ~ l)/2 



E{A n+1 \T n )=A n [l + 
(A.4) < An + A n 



+ 



(n + l)(l + L n )J (n + l)L 
1 + L n — j n 



(n + l)(l + L n ) 



I{L n > M + I{L n < k e }} 



+ 



P 2 L n + p 



2(n + l) 

Since A n < n, the right-hand side of (A.4) is less than or equal to 
p*(l + e)- 



A n 



1 + 



n 



n + 1 2(n + 1) 



Hence, with £<{pA c p } and large enough n, 



£?(Ah-i) < E(A n ) 
< E(A n ) 



1 P*(l + g) 



71 



1 + 



77 



c e,p,fc e . P 2 (g + Cp) 

n 1 ^ 2n 1 -P 



+ 



77 



1-p- 



Let 71 = 1 and define 7 n +i = 7n/[l + p i ]• Thus, {7 n } is a decreasing 
sequence, and there exists < 700 < 00 such that 

lim n p *( 1+£ )7 n = 7oo . 



n— too 

Note that for large enough n, 



2c p 7o 



ln + iE{A n+1 ) < ln E(A n ) + ^_ p+ p ;,7 1+£) 



!7' 



Because 1 — p + p* > 1, it follows that 



limsup7 n i?(A ri ) < co, that is, limsup£M — «, , < 00. 



Hence 



lim E 

n— >oo 



0. 



□ 
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